Name | Version | Summary | date |
---|---|---|---|
shtec-rlhf | 0.0.2.dev0 | shtec-rlhf: Safe Reinforcement Learning from Human Feedback | 2024-04-19 03:10:53 |
trl | 0.8.4 | Train transformer language models with reinforcement learning. | 2024-04-17 15:16:50 |
hour | day | week | total |
---|---|---|---|
94 | 1584 | 9498 | 204120 |